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Introduction 


The  purpose  of  this  proposal  is  to  provide  insight  into  gene-environment  interactions  in  a 
population  of  military  working  dogs  (MWD).  It  leverages  the  simplified  genetics  and  detailed 
records  of  the  MWD  population.  There  are  several  critical  aspects  to  meeting  the  aims  of  this 
proposal.  1)  development  of  data-driven  selection  criteria,  2)  biological  sampling  of 
representative  dogs,  and  3)  generation  of  mathematical  methodologies  capable  of  handling 
heterogenous  data  and  statistical  tests  in  consistent  manner  and  providing  clear  and 
understandable  results  that  are  biologically  valid.  Each  of  these  criteria  posses  their  own 
challenges  that  must  be  overcome  before  the  project  will  be  successful.  Here  we  will  provide  a 
breakdown  of  the  previous  year’s  work  that  has  occurred  and  document  our  progress  towards 
achieving  the  specific  aims  we  proposed. 
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Body 


Task  1-  Regulatory  Approval:  The  regulatory  approval  is  currently  with  Lackland  AFB  (LAFB). 
We  have  received  Exempt  status  for  our  data  protocol  and  our  biological  sampling  protocol  is 
currently  under  review.  We  also  have  received  approval  from  the  Joint  Pathology  Center  to 
review  their  records.  JPC  has  also  sent  copies  of  all  pathology  reports  associated  with  the  DoD 
puppy  program.  My  personal  involvement  in  year  one  has  been  limited  to  close  personal  and 
electronic/telephone  interactions  with  the  Military  Veterinary  Corp  at  LAFB.  I  visited  LABD 
twice,  evaluated  the  medical  records  system,  and  discussed  future  approaches  to  data  analysis 
with  the  LAFB  veterinarians. 

Task  2-  Data  Capture  of  Veterinary  Records:  We  have  received  data  from  the  TSA.  This  data  was 
used  to  establish  the  preliminary  database.  In  addition  we  have  identified  software  that  will  be 
highly  efficacious  to  capture  data  from  the  pathology  records.  We  have  also  established  all  of  the 
necessary  computational  infrastructure  at  Lackland  AFB.  We  have  a  high-speed  scanner  in  place 
and  a  high-performance  computer  to  process  all  of  the  data  collected.  We  have  also  conducted 
regular  conference  with  Lackland  AFB  and  our  site  visits  have  yielded  additional  collaborations. 

Task  3-Methodology  Development:  Tasks  3a  and  3b  are  complete.  Tasks  3c-3h  are  scheduled  for 
future  reporting  periods.  Task  3g  is  in  progress.  We  have  submitted  articles  and  received 
comments.  We  are  rewriting  the  manuscripts  to  address  reviewers  concerns.  Over  the  past  year 
we  have  met  multiple  relevant  scientific  and  logistic  benchmarks  we  have  also  encountered  and 
solved  several  problems  associated  with  the  study.  Most  notably  we  have  two  publications 
(appendix  1  and  appendix  2)  that  were  submitted  and  we  are  now  making  the  changes 
recommended  by  the  reviewers.  In  regard  to  the  statement  of  work  we  have  met  several  key 
benchmarks  including  development  of  novel  analytic  techniques.  These  techniques  (SOW  Task 
3a-3c)  are  critical  to  the  success  of  this  project.  Specifically  we  have  developed  a  technique  we 
refer  to  as  Variable  Genomic  Intersect  Analysis  or  GIA  for  short.  GIA  overcomes  limitations  of 
traditional  genomewide  genetic  association  studies  by  1)  accounting  for  latent  variables  without 
information  loss  or  distortion,  2)  not  requiring  multiplicity  testing  correction  and  3)  significantly 
increasing  power.  This  method  is  computationally  fast  and  broadly  applicable  to  simple  and 
complex  genetics,  as  well  as  to  genomics.  Although  detailed  in  our  manuscript  it  is  important  to 
include  here  as  well.  Again,  my  personal  involvement  in  this  area  has  been  limited  to  interacting 
with  the  Army  veterinarians  and  assisting  in  protocol  design.  My  participation  in  this  project  will 
increase  markedly  in  years  2  and  3. 


Task  4  and  5 

As  noted  above  we  have  developed  a  highly  sensitive  methodology  capable  of  identifying  genetic 
variation  using  a  small  number  of  animals.  We  have  also  validated  this  methodology  in  two 
separate  populations  l)a  publicly  released  dataset  consisting  of  multiple  breeds  and  2)  a  quasi¬ 
military  population  of  dogs:  the  retired  racing  Greyhound. 
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Besides  Task  three  we  have  also  completed  task  5a.  Osteosarcoma  is  the  most  common 
cause  of  death  in  retired  racing  Greyhounds,  but  is  extremely  rare  in  show  (i.e.;  AKC) 
Greyhounds.  We  collected  samples  from  Osteosarcoma  Positive  Racing  Greyhounds  (n=12)  and 
Osteosarcoma  Negative  Racing  Greyhounds  (n=12)  in  collaboration  with  the  PI.  Samples  from 
AKC  show  dogs  (n=12),  were  collected  in  collaboration  with  the  AKC  at  a  show  in  California. 
Samples  were  shipped  overnight  on  ice  for  processing  in  the  laboratory  at  The  Research  Institute 
at  Nationwide  Children’s  Hospital.  All  samples  were  verified  for  registration  and  pedigree 
information  (for  racing  Greyhounds,  NGA  registration;  for  show  Greyhounds,  AKC  registration) 
and  were  determined  to  be  unrelated  to  at  least  the  level  of  grandparents.  Suspected 
osteosarcomas  were  confirmed  by  a  Veterinary  Oncologist/Pathologist  after  surgical  excision  of 
the  tumor.  All  blood  samples  were  obtained  prior  to  initiation  of  chemotherapy.  For  controls,  we 
invited  owners  of  adopted  retired  racing  Greyhounds  who  were  osteosarcoma  free  and  >10  years 
of  age  to  participate  in  this  research  project.  Interested  owners  were  screened  for  inclusion,  and 
subsequently  informed  consent  for  blood  collection  was  obtained  from  the  owner,  and  sample 
collected  by  a  trained  veterinary  technician  in  1-2  7  mL  BD  lavender  top  tubes. 

Results  of  this  study  are  provided  in  Dr.  Alvarez’s  report.  My  role  in  the  development  of 
this  methodology  has  been  critical.  I  was  able  to  acquire  genetic  material  and  appropriately 
identify  dogs  that  were  ideal  for  our  new  methodology.  My  direct  contribution  was  to  aid  in  the 
development  of  appropriate  animal  protocols  and  to  coordinate  with  members  of  the  military 
clinical  team  at  Lackland  AFB,  as  described  above.  I  was  involved  in  meetings  with  appropriate 
personnel  and  aided  in  the  vetting  of  Mrs.  Michelle  Perez,  our  veterinary  technician  at  LAFB.  I 
has  also  worked  diligently  in  refining  the  DAPER  database  and  selecting  appropriate  criteria  for 
study.  My  primary  role  in  the  study  has  not  occurred  yet.  The  majority  of  my  responsibilities  will 
take  place  in  the  upcoming  years 


Task  6-  Adaptation  of  existing  resources,  data  storage  and  hosting:  There  have  been  two  areas  of 
progress: 

1. )  Development  of  a  Canine  Medical  Record  System 

2. )  Design  of  a  Workflow  for  digitizing  paper  medical  records 

Development  of  a  Canine  Medical  Record  System  (CMRS) 

A  prototype  of  a  research  CMRS  has  been  completed.  Our  research  CMRS  allows  the 
creation,  search,  and  modification  canine  medical  records  based  on  a  group  of  standard  medical 
record  forms  used  in  the  military  like  the  Form  1829,  Immunization  Form,  Death  Certificate 
Form,  and  Master  Problem  List.  The  software  is  meant  as  a  research  tool,  rather  than  an 
operational  tool,  for  viewing,  searching,  and  analyzing  medical  records  that  are  captured  as  part 
of  our  digitization  workflow  described  below.  Our  goal  is  to  populate  the  CMRS  database  with 
data  generated  from  our  automated  digitization  process  described  in  the  next  section.  My 
involvement  in  this  portion  of  the  project  has  been  primarily  as  an  advisor.  Details  of  this  portion 
of  the  project  are  given  in  Dr.  Alvarez’s  report. 

Digitization  Workflow 
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A  preliminary  design  of  a  workflow  to  digitize  (i.e.  make  database  searchable)  an  archive  of 
canine  medical  records  at  Lackland.  The  workflow  is  designed  to  be  automated  and  scalable.  A  critical 
part  of  this  workflow  is  incorporating  third-party  software  for  OCR  (Optical  Character  Recognition),  ICR 
(Intelligent  Character  Recognition)  and  HWR  (Handwriting  Recognition).  A  number  of  various  software 
packages  were  evaluated  and  we  have  settled  on  a  particular  software  vendor,  ABBY  Flexi-capture.  We 
looked  carefully  at  Form  1829,  and  the  Chronological  Medical  Record  form  to  ensure  that  the  third-party 
software  is  able  to  recognize  check-boxes  and  columns  and  do  a  reasonable  job  with  handwriting  -  which 
is  does. 


This  workflow  will  also  incorporate  the  Joint  Pathology  Center  Pathology  Reports  (and 
Consultation  reports).  The  text  will  be  entered  into  a  data  repository  that  will  be  further  processed  by  a 
different  pipeline  to  evaluate  the  quality  of  the  OCR,  ICR,  and  HWR.  The  raw  data  will  then  be  cross- 
referenced  with  a  controlled  vocabulary  (e.g.  SNOMED,  VeNom,  ICD-  9)  and  the  matches,  or 
occurrences  of  controlled  terms  in  the  documents,  will  be  stored  in  an  inverted  index  or  specialized 
database  table.  The  terms  can  then  be  correlated  against  specific  canines  and  stored  in  a  document 
database.  We  have  not  yet  evaluated  any  of  the  various  Controlled  Vocabularies. 


Conclusions 

Although  my  involvement  in  the  first  year  has  been  limited,  we  have  remarkable  accomplishments  to 
show.  I  am  looking  forward  to  additional  participation  in  this  project. 


